Issues in Translating Verb-Particle Constructions from German to English
نویسندگان
چکیده
In this paper, we investigate difficulties in translating verb-particle constructions from German to English. We analyse the structure of German VPCs and compare them to VPCs in English. In order to find out if and to what degree the presence of VPCs causes problems for statistical machine translation systems, we collected a set of 59 verb pairs, each consisting of a German VPC and a synonymous simplex verb. With this data, we constructed 118 sentences in which the simplex verb and VPC are completely substitutable and translated the resulting dataset to English using Google Translate and Bing Translator. Through an analysis of the resulting translations we are able show that quality of the results decreases when translating sentences that contain VPCs instead of simplex verbs, in particular if they are separated from the verb root.
منابع مشابه
Project Proposal: How Verb-Particle Constructions Affect Translation Quality in English and German
This project deals with verb-particle constructions (VPCs) in the context of statistical machine translation, with the purpose of finding out if and to what degree they affect translation quality in English and German. Its main goal is to identify which types of VPCs cause translation errors and to find possible solutions to this problem. VPCs are a type of multiword expressions (MWEs), meaning...
متن کاملDiscontinuous Verb Phrases in Parsing and Machine Translation of English and German
In this paper, we focus on the verb-particle (V-Prt) split construction in English and German and its difficulty for parsing and Machine Translation (MT). For German, we use an existing test suite of V-Prt split constructions, while for English, we build a new and comparable test suite from raw data. These two data sets are then used to perform an analysis of errors in dependency parsing, word-...
متن کاملHow to Account for Idiomatic German Support Verb Constructions in Statistical Machine Translation
Support-verb constructions (i.e., multiword expressions combining a semantically light verb with a predicative noun) are problematic for standard statistical machine translation systems, because SMT systems cannot distinguish between literal and idiomatic uses of the verb. We work on the German to English translation direction, for which the identification of support-verb constructions is chall...
متن کاملAutomatic Extraction of Synonyms for German Particle Verbs from Parallel Data with Distributional Similarity as a Re-Ranking Feature
We present a method for the extraction of synonyms for German particle verbs based on a word-aligned German-English parallel corpus: by translating the particle verb to a pivot, which is then translated back, a set of synonym candidates can be extracted and ranked according to the respective translation probabilities. In order to deal with separated particle verbs, we apply re-ordering rules to...
متن کاملTransitive Particle Verbs in English: The Neutral Order Evidence from Speech Production
With regard to the splitting of the verb-particle complex, the behaviour of English PV's differs from that of their German, Dutch, and Scandinavian counterparts. It is assumed in generative circles that the complex verb is base-generated in the VP-fmal position in the V2-1anguages German and Dutch. In V2 contexts such as main clauses the verb obligatorily moves syntactically into the highest fu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014